Enrichment Procedures for Soft Clusters: A Statistical Test and its Applications
نویسندگان
چکیده
Clusters, typically mined by modeling locality of attribute spaces, are often evaluated for their ability to demonstrate ‘enrichment’ of categorical features. A cluster enrichment procedure evaluates the membership of a cluster for significant representation in pre-defined categories of interest. While classical enrichment procedures assume a hard clustering definition, in this paper we introduce a new statistical test that computes enrichments for soft clusters. We demonstrate an application of this test in refining and evaluating soft clusters for classification of remotely sensed images.
منابع مشابه
The Feasibility of Industrial Production of Lipases with an Emphasis on Its Applications in Food Enrichment
Background: Lipases are the most flexible biocatalysts and they catalyzes Bioconversion reactions wide range. These enzymes have beneficial effects on food substrates such as natural oils, synthetic triglycerides and fatty acids. Lipases are used in a wide range of modern biotechnology industries, such as the synthesis of biopolymers, biodiesel and the pharmaceutical industry in addition use in...
متن کاملFUZZY SOFT SET THEORY AND ITS APPLICATIONS
In this work, we define a fuzzy soft set theory and its related properties. We then define fuzzy soft aggregation operator that allows constructing more efficient decision making method. Finally, we give an example which shows that the method can be successfully applied to many problems that contain uncertainties.
متن کاملON SOFT ULTRAFILTERS
In this paper, the concept of soft ultrafilters is introduced and some of the related structures such as soft Stone-Cech compactification, principal soft ultrafilters and basis for its topology are studied.
متن کاملContinuous Iterative Guided Spectral Class Rejection Classification Algorithm: Part 1
This paper outlines the changes necessary to convert the iterative guided spectral class rejection (IGSCR) classification algorithm to a soft classification algorithm. IGSCR uses a hypothesis test to select clusters to use in classification and iteratively refines clusters not yet selected for classification. Both steps assume that cluster and class memberships are crisp (either zero or one). I...
متن کاملWeighted Kolmogorov Smirnov testing: an alternative for Gene Set Enrichment Analysis.
Gene Set Enrichment Analysis (GSEA) is a basic tool for genomic data treatment. Its test statistic is based on a cumulated weight function, and its distribution under the null hypothesis is evaluated by Monte-Carlo simulation. Here, it is proposed to subtract to the cumulated weight function its asymptotic expectation, then scale it. Under the null hypothesis, the convergence in distribution of...
متن کامل